Lasso tree for cancer staging with survival data.

نویسندگان

  • Yunzhi Lin
  • Sijian Wang
  • Richard J Chappell
چکیده

The tumor-node-metastasis staging system has been the lynchpin of cancer diagnosis, treatment, and prognosis for many years. For meaningful clinical use, an orderly grouping of the T and N categories into a staging system needs to be defined, usually with respect to a time-to-event outcome. This can be reframed as a model selection problem with respect to features arranged on a partially ordered two-way grid, and a penalized regression method is proposed for selecting the optimal grouping. Instead of penalizing the L1-norm of the coefficients like lasso, in order to enforce the stage grouping, we place L1 constraints on the differences between neighboring coefficients. The underlying mechanism is the sparsity-enforcing property of the L1 penalty, which forces some estimated coefficients to be the same and hence leads to stage grouping. Partial ordering constraints is also required as both the T and N categories are ordinal. A series of optimal groupings with different numbers of stages can be obtained by varying the tuning parameter, which gives a tree-like structure offering a visual aid on how the groupings are progressively made. We hence call the proposed method the lasso tree. We illustrate the utility of our method by applying it to the staging of colorectal cancer using survival outcomes. Simulation studies are carried out to examine the finite sample performance of the selection procedure. We demonstrate that the lasso tree is able to give the right grouping with moderate sample size, is stable with regard to changes in the data, and is not affected by random censoring.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Tentative Staging of Multiple Myeloma by Utilizing Respective Coefficients of Prognostic Factors

Introduction: Multiple myeloma is a heterogeneous disease with different survival times among patients. Accurate prediction of prognosis in multiple myeloma is essential, as patients with a shorter survival time may require early bone marrow transplantation (BMT) and more advanced chemotherapy as a part of their first-line treatment. In the present study, a parameter, depicted by ga...

متن کامل

A Tentative Staging of Multiple Myeloma by Utilizing Respective Coefficients of Prognostic Factors

Introduction: Multiple myeloma is a heterogeneous disease with different survival times among patients. Accurate prediction of prognosis in multiple myeloma is essential, as patients with a shorter survival time may require early bone marrow transplantation (BMT) and more advanced chemotherapy as a part of their first-line treatment. In the present study, a parameter, depicted by ga...

متن کامل

Triage of Limited Versus Extensive Disease on 18F-FDG PET/CT Scan in Small Cell lung Cancer

Objective(s): Small cell lung cancer (SCLC) is an aggressive neuroendocrine carcinoma, which accounts for 10-15% of pulmonary cancers and exhibits early metastatic spread. This study aimed to determine the added value of 18F-FDG PET/CT imaging in tumor, node, and metastasis (TNM) staging of SCLC, compared to the conventional computed tomography (CT) scan and its potential role as a prognosticat...

متن کامل

Using data mining techniques for predicting the survival rate of breast cancer patients: a review article

    This review was conducted between December 2018 and March 2019 at Isfahan University of Medical Sciences. A review of various studies revealed what data mining techniques to predict the probability of survival, what risk factors for these predictions, what criteria for evaluating data mining techniques, and finally what data sources for it have been used to predict the surv...

متن کامل

Extracting Predictor Variables to Construct Breast Cancer Survivability Model with Class Imbalance Problem

Application of data mining methods as a decision support system has a great benefit to predict survival of new patients. It also has a great potential for health researchers to investigate the relationship between risk factors and cancer survival. But due to the imbalanced nature of datasets associated with breast cancer survival, the accuracy of survival prognosis models is a challenging issue...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biostatistics

دوره 14 2  شماره 

صفحات  -

تاریخ انتشار 2013